Extraction of Excitation Information from Speech and Its Applications for Expressive Speech Processing

نویسندگان

  • Sudarsana Reddy Kadiri
  • B. Yegnanarayana
چکیده

Through speech production mechanism, speech with different voice qualities such as phonations, emotions, expressive singing and other paralinguistic sounds are also produced. Most of these sounds demonstrate these features mostly due to the excitation component (vibration of the vocal folds at the glottis) whereas the dynamic vocal tract system primarily conveys the message. Hence, the excitation source processing acquires significance especially for the analysis, detection and representation of expressive voices. Most of the existing excitation source information extraction methods are not reliable especially when applied on expressive voices, mainly due to significant source-system coupling. Hence, there is a need for new signal processing methods that can capture the dynamic variations in excitation source so that different types of sounds can be better analyzed and represented. The objective of this work is to derive new signal processing methods to extract the excitation source information directly from the signal and then investigate the significance of this information for the analysis and detection of various expressive voices. Towards this, some of the excitation source features are extracted using recently proposed signal processing methods and then the significance of these excitation features are studied in emotional speech analysis and recognition. Presently the studies are in progress in representing the excitation source in the form of impulse-like sequence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

Advances in Glottal Analysis and its Applications

From artificial voices in GPS to automatic systems of dictation, from voice-based identity verification to voice pathology detection, speech processing applications are nowadays omnipresent in our daily life. By offering solutions to companies seeking for efficiency enhancement with simultaneous cost saving, the market of speech technology is forecast to be particularly promising in the next ye...

متن کامل

Diagnostic Accuracy of the Photographic Expressive Persian Grammar Test to Identify 4-6 Years Old Children With Developmental Language Disorder

Objectives: Accurate diagnosis of Persian children with Developmental Language Disorder (DLD) is regarded as a challenge for Speech and Language Pathologists (SLPs) in Iran because of the lack of formal linguistic tests that can reliably distinguish language-impaired children from Typically-Developing (TD) children. This study aimed to investigate the diagnostic accuracy of the photographic exp...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

Discourse Structures of Condolence Speech Act

Condolence is part of Austin’s expressive speech act and is related to Searle’s behabitives illocutionary act. Although a theoretically sound issue in pragmatics, condolence speech act has not been investigated as much as other speech acts in discourse-related studies. This paper aims at investigating interjections and intensifiers while performing condolence speech act among Persian and Englis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017